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A data depth measures the ccntrahty of a point with respect to 
an empirical distribution. Postulates are formulated, which a depth 
for functional data should satisfy, and a general approach is proposed 
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approach, mentioned as <I>-depth, is based on depth infima over a 
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'— ' are established for the $-depth and a generalized version of it. The 
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1 Introduction 

In multivariate data analysis a depth function measures how 'deep' a point is 
located in a given data cloud in Euclidean d-space, that is, how close it is to 
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an implicitly defined 'center' which, in turn, has maximal depth. Notions of 
data depth in M.'^ are closely related to those of multivariate quantiles, central 
ranks and outlyingness. The upper level sets of a depth function form central 
regions that refiect the location, scale and shape of the given distribution. 
By this, data depth has become a powerful tool of nonparametric analysis in 



Many notions of multivariate data depth have been proposed in the litera- 



ture, starting with Tukey (1975) and Liu (1990). They have been successfully 
applied to problems of nonparametric statistical analysis in M*^ such as classi- 
fication (or supervised learning), hypothesis testing, and others. A theory of 
depth functions in M'^ has been developed that includes population versions 
(that is, depth with respect to a probability distribution) as well as basic 



postulates and other properties shared by these notions; see Zuo and Serfiing 



(2000), Mosler (2002), Dyckerhoff (2004) and the recent surveys by Serfiing 



(2006) and Cascos (2009). For exemplary applications refer to Liu et al 



(1999), Li et al. (2012), and Dutta and Ghosh (2012). Also several depth 



notions have been proposed for functional data, e.g., by Fraiman and Muniz 



(2001), Cuesta-Albertos and Nieto-Reyes (2008b), Lopez-Pintado and Romo 



(2009), and Claeskens et al. (2012). Intended applications include problems 



of classification, outlier detection and the trimming of functional data. 

What is still missing is a theory of depth functions in functional spaces, in 
particular a general definition based on proper postulates to be imposed on a 
functional depth. The present paper contributes to this issue in two respects. 
First, by formulating a set of minimal postulates for a depth function in Ba- 
nach spaces and, second, by introducing a comprehensive class of functional 
depths, named ^-depths. The postulates are weak enough to generate non- 
trivial depth notions. They are contrasted with further postulates, weaker 
as well as and stronger ones. Especially, in the case of symmetrically dis- 
tributed data, the postulates imply that the depth takes its maximum at the 
center of symmetry. A functional data depth D generates central regions 
Da, consisting of all functions that have at least a certain depth a G [0, 1]. 
These regions describe the data cloud regarding its location, variation and 
functional shape; they satisfy similar postulates. The definition of $-depths 
is based on d-variate depth infima over a properly chosen set $ of M'^-valued 
linear functions. Each if E ^ may be regarded as a particular 'view' on the 
data or 'aspect' of them. This can be the projection of the data distribution 
to some finite-dimensional marginal. An interesting question is whether the 
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projection of a central region Da is equal to the respective central region of 
the projection of the data. We provide a necessary and sufficient condition 
for this. 

By specializing the set $ many different notions of depth are obtained. As 
important subclasses of <I>-depths we introduce the general graph depths and 
the grid depths. The first mentioned include known functional depths like 



the band depth (Lopez-Pintado and Romo 2009) and the half-graph depth 



(Lopez-Pintado and Romo 2005), as well as many others. The location- 
slope graph (or grid) depth is a bivariate depth that operates on functions 
and their first derivatives simultaneously. It can be used to analyze warped 
functional data by incorporating their slope together with a warping function 
that has been estimated from the data. Two extensions of the •I'-depth 
suggest themselves: to take a weighted infimum of (i-variate depths and to 
make the set $ dependent on the data. Both extensions come out to be 
compatible with the basic postulates. The latter gives rise to the notion of 
principal component depth, where an m-variate data depth is applied to the 
loadings of the first m principal components. 

Overview of the paper: Section 2 gives a short account of data depth in 
finite dimensions and its basic properties. In Section 3 a set of postulates 
is formulated that define a general functional data depth; also additional 
postulates are given that may be satisfied in special cases. The class of 
^-depths is introduced and its properties are derived. Section 4 discusses 
restrictions to be imposed on $ that are specific to the functional data setting. 
Special classes of $-depths are discussed in Section 5. Section 6 presents the 
generalized $-depth and, in particular, the principal component depth, while 
Section 7 gives an outlook on population versions and alternative approaches. 



2 Multivariate data depth 

First let us recapitulate the notion of a depth for data in finite-dimensional 
space M'^. A multivariate (d-variate) data depth is a bounded function that, 
to a given data cloud X = {x^, . . . C M'^ and a point y G M"^, d > 1, 
assigns a depth value D{y\X) = D{y\x^, . . . G M that satisfies certain 
postulates, that is, desirable properties. For the present inquiry we use the 



following set of postulates, which is due to Dyckerhoff (2002): 
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• Dl Translation invariant: D(z+b\x^+b, . . . , = D{z\x^, . . . , x") 
for all 6 G M'^ , 

• D2 Linear invariant: D(Az\Ax^, . . . ,Ax'^) = D( ) for ev- 
ery regular matrix A G M.'^^'^ , 

• D3 Null at infinity: limpn^oo D{z\x\ . . . , x") = . 

• D4 Monotone on rays: If a point z* has maximal depth, that is 
D{z*\x^, . . . , x") = max^gjjd D(2;|x^, . . . , x") , then for any r in the unit 
sphere S*"^"^ the function a i-^ /^(z* + ar\x^, . . . ,x^) decreases with 
a > , 

• D4C Quasiconcave: D{-\x^, . . . ,x^) is a quasiconcave function, that 
is, its upper level sets Da{x^, . . . ,x") = {2; G M'' : D{z\x^, . . . ,x") > a} 
are convex for all a > . 

• D5 Upper semicontinuous: The upper level sets Da{x^, ■ ■ ■ ,x") are 

closed for all a > . 



Slightly different postulates have been given by Liu (1990) and Zuo and 



Serfling (2000). The main difference between these postulates and those 



above is that they refer to a center of symmetry at which depth should attain 
its maximum and that they do not require upper semicontinuity (which serves 
as a useful technical restriction). At the end of Section [s] we will come back 
to the behavior of a depth under certain notions of symmetry. 

For a > the level sets Da{x^, ■ ■ ■ ,x"') form a nested family. They are 
mentioned as depth trimmed regions or central regions, with a measuring the 
degree of centrahty. The above postulates can be equivalently formulated in 
terms of these regions. Dl and D2 say that the family of central regions 
is equivariant against shifts and changes of scale, respectively. D3 means 
that for any a > the region Dq,(x^, . . . , x") is bounded. D4 states the 
starshapedness of each Da{x^, . . . , x") with respect to z*. D4C and D5 say 
that each region is convex and closed, respectively. Obviously, as a convex 
set is starshaped with respect to each of its points, D4C implies D4. 

Depth trimmed central regions describe a data cloud x^, . . . ,x" G M'^ with 
respect to location, dispersion, and shape. This has many applications in 
multivariate data analysis as well as inference. By definition a ci-variate 
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data depth is bounded. If there is a point of maximum depth, this depth 
will w.l.o.g. be set to 1. Then the innermost level set arises at a = 1, and 
Di{x^, . . . , x") is the set of deepest points. 

More general, in place of the data cloud a probability measure /i or a random 
vector X on M.'^ can be considered. This is mentioned as a multivariate depth. 
In turn, apphed to an empirical distribution fi that gives equal probabilities 
to the points multivariate data depth is obtained. 

Important examples of multivariate depth functions are, among many others, 
the Mahalanobis depth, the Tukey (or halfspace) depth, the simplicial depth 



(Liu, 1990), the projection depth (Liu (1992), Zuo and Serfiing (2000)), and 



the zonoid depth (Koshevoy and Mosler 1997). The various depths proposed 
in the literature all satisfy the postulates Dl to D3 (some only orthogonal 
invariance in place of linear one), while the remaining postulates are met to 
a different extent. They show different additional features regarding their 
practical apphcability (like computability and robustness) as well 
lytical properties (like continuity and characterization through marginals), 
which permit their application in different statistical tasks. 



3 <l>-depth for functional data 

Consider a Banach space E having some norm || ■ ||. Let E' be the dual 
space of all continuous linear functionals endowed with the operator norm 
\\(p\\' = suppii^i lv'(2^)l) Be and Be' be the unit balls {Se and Se' the unit 
spheres) of E and E', respectively, and B the Borel sets of E. Prominent and 
practically relevant examples for E include spaces of function^ mapping a 
compact interval J to Euclidean space M'^, in particular: 

• the space C{J) of real- valued continuous functions x : J — )■ M with a 
norm ||x||oo = sup^gj |a;(t)|, 

• the space L'^{J) of real- valued square-integrable functions a; : J — )■ M 
with a norm ||x||2 = {Jjx'^(t)dtY^'^, 

^For an K*^ -valued function x we notate x{t) — {xi{t), . . . , Xk{t)). 



5 



• the space of M*^- valued continuous functions on J, C ( J; M*^) , endowed 
with the norm ||x||oo = sup^gj where || ■ || is an arbitrary norm 
on M'^, 

• the space of real-valued m-times continuously different iable functions 

on J, C""(J), with the norm = maXs<m{||x*^''^||oo}- 

• the space P' of sequences x = {x{t))tm in ^ that have finite norm 
I klip — (X^t^i \^{'t)\P)p, for some p, 1 < p < oo, 

A functional data depth is a real- valued functional that, given a finite data 
cloud of elements in E, indicates how 'deep' another given element of E is 
located in the data cloud, that means, how 'close' it is to the 'center' of the 
cloud. Of course, the meaning of 'deep', 'close' and 'center' are implicitly 
determined by the functional depth. In the sequel we investigate functional 
depths of a general form that is given in Definition [l| This definition includes 
several notions of functional data depth that are known from the literature 
as well as many new ones. Some other existing notions, that are not covered 
by our definition, will be addressed in Section [7] below. 

Definition 1 ($-depth) For G E, define 

D{z\x\ . . . , x") = inf D\v{z)\v{x'), . . . , ^(x")) , (1) 

where D'^ is a d-variate data depth satisfying Dl, D2, D3, D4, and D5, 
$ c E"'' , and E"^ is the space of continuous linear functions E — >■ . D is 
called a $-depth. 

For short we will notate the data clouds by X = {x^, . . . , x"}, ^{X) = 
Wix^), . . . , ¥?(x")}, and similarly AX = {Ax^ . . . Ax"}, AX = {Ax\ . . . Ax"}, 
etc.. So the defining equation ([T]) becomes 

D{z\X) = mlD'^{^{z)MX)) . 

Each if in our definition may be regarded as a particular aspect we are 
interested in and which is represented in d-dimensional space. The depth of 
z is given as the smallest multivariate depth of z under all these aspects. It 
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implies that all aspects are equally relevant so that the depth of z cannot 
be larger than its depth under any aspect. For example, '^[^z) may be the 
evaluation of a function z G -Z^^(0, 1) at some t G [0, 1] or another projection 
of 2 to a finite-dimensional space. 

As the (i-variate depth D'^ takes its maximum at 1, the functional data 
depth D is bounded above by 1. At every point z* of maximal D-depth 
it holds D{z*\X) < 1. The bound is attained with equality, D{z*\X) = 1, iff 
D'^{ip{z*)\(p{X)) = 1 holds for all <^ G $, that is, iff 

z* e f] ^-\Df{^{X))) . (2) 

Next we like to formulate postulates which a meaningful definition of func- 
tional depth should reasonably satisfy and check their eventual restrictions 
implied by them on the class $ in Definition [T] Our first postulates extend 
the multivariate postulates Dl to D5 to the general setting; they involve 
elements of an arbitrary Banach space. Further postulates are specific to 
spaces of functions on a real interval. 

• FDl Translation invariant: D{z + b\X + b) = D{z\X) for all 6 G E , 

• FD2 Scale invariant: D{\z\\X) = D{z\X) for all A > , 

• FD3 Null at infinity: lim||^||^oo ^(^|^) =0, 

• FD4 Monotone on rays: For any z* with D{z*\X) = 1 and any 
r G Be the function a t-)- D{z* + ar\X) decreases with a > , 

• FD4con Quasiconcave: The upper level sets Da{X) = {z E E : 
D{z\X) > a} are convex for all a > 0. 

• FD5 Upper semicontinuous: The sets Da{X) are closed for all 
a > . 

The invariance postulates FDl and FD2 say, which aspects of the data 
a functional depth should not reflect, while FD3 to FD4con essentially 
postulate that the depth function is unimodal and decreases from maximum 
at its mode to zero at infinity. Again, FD4con implies FD4. A functional 
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depth that satisfies the stronger postulate FD4con is named a convex depth. 
FD5 is a technical assumption. Postulate FD2 may be strengthened to full 
linear invariance, 

• FD2L Linear invariant: D{Az\AX) = D{z\X) for every isomor- 
phism A : E ^ E (that is, for every linear continuous transformation 
A: E ^ E that has a continuous inverse) , 

which corresponds to the postulate D2 of multivariate depth. FD2L appears 
to be a rather strong restriction; it implies that the depth is invariant against 
all hnear isometrics of E. Moreover, when £■ is a function space like C(J) 
with II ■ lloo, e.g. all transformations of type A^x — x — k{-,t)x{t)dt (with 
some kernel |A;| < 1) are included in the invariance postulate. Also, the depth 
then is invariant against any rearrangement Ap of the functions, Apx(t) = 
{x o p){t)), with an arbitrary bijection p : J ^ J. We formulate the latter as 
a two separate postulates: 

• FD2R Rearrangement invariant: 

D{z o p\X op) — D{z\X) for every bijective function p : J ^ J , 

• FD2IR Increasing rearrangement invariant: D[z o p\X o p) — 

D[z\X) for every increasing bijective function p : J ^ J ^ 

and FD2R implies FD2IR. We continue with another postulate that is 
specific to function spaces. Let £■ be a Banach space of functions J — > R*^, 
where J is a real interval, and notate by £'0 the subset of functions a & E 
that vanish nowhere a{t) ^ for all t & J. 

• FD2F Function-scale invariant: If E is equipped with a product, 
say, the pointwise product '•' of functions, it holds D{a ■ z\a ■ X) = 
D{z\X) for all a e Eo . 

Obviously, both function-scale invariance FD2F and full linear invariance 
FD2L are stronger than scale invariance FD2. 

Now we proceed to our first main result, which says that a $-depth indeed 
satisfies the relevant postulates. 
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Theorem 1 (i) A ^-depth (Tp always satisfies FBI, FD2, FD4, and FD5. 

(ii) The depth satisfies FD3 iff for every sequence (2;*) with \ \z^\ \ — )■ 00 exists 

a if in such that (p{z^) — ?■ 00 . 

(iii) It satisfies FD4con if the underlying d-variate depth satisfies D4C. 

The proof is given in the Appendix. When the functional depth is univariate 
[d = 1) and aU functions in $ are increasing, an additional monotonicity 
postulate is satisfied, which refers to the pointwise order of functions: 

Proposition 1 Let d = 1 and all ip E ^ be increasing. Then the ^-depth 
|7]j satisfies the postulate FD4pw, 

• FD4pw Monotone: For every z* E E having maximal depth it holds 
that D{y\X) > D{z\X) whenever either z* < y < z or z < y < z* . 

Proof. Let z* have maximal depth and assume z* < y < z. Then (f{z*) < 
^(y) < foi' clU (as ip is increasing) and 

D\^iy)\^iX)) > D\^iz)\^iX)) 

due to D4. Taking the infimum on the left and right hand sides yields 
D{y\X) > D{z\X) . The case z* > y > z is similarly shown. <) 

We conclude this section with properties of the trimmed regions that originate 
from a functional depth. First we provide a list of postulates on the family 
{Da{X)}a which are equivalent to the above postulates FDl to FD5 on the 
depth D: For all X = {x^, . . . , x"} G ^ and a > 0, 

m FRl: Da{X + b) = Da{X) + b , beE, 

• FR2: D^{XX) = XD^{X) , A > , 

• FR3: D^{X) is bounded if a > 0, 

• FR4: Da{X) is starshaped with respect to all z* e fl^e* V'^i^iivi^))) > 

• FR4C: Da{X) is convex. 
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• FR5: Dc,{X) is closed. 

Further, for any family of depth trimmed central regions we obtain the equa- 
tion 

D„(X) = fl Z}^(X) , ae]0,l], (3) 
which is obvious as it holds for the upper level sets of any function. 



Theorem 2 Let D he a functional depth D according to Definition ([ijj. For 
the level sets Da of D it holds 

Da{X)=^^-\Di{^{X))). (4) 

</3G<I> 

The theorem is proven in the Appendix. Observe that the right hand side 
in Equation Q can be empty for some < 1 and, consequently, for all 
a G [ao,l]- This happens also ii E = and c? = 1, i.e., if the depth 
([T]) is a multivariate depth; e.g., the halfspace depth. Da is nonempty for 
all a G [0, 1] iff the set ^^^^^-^{Df{ip{X))) is not empty, that IS, some z 
exists satisfying 

As it was mentioned above, other sets of postulates for a multivariate depth 
require that the depth should be maximal at some center of symmetry. Dif- 
ferent notions of symmetry with respect to the origin have been considered 
in the context of special data depths: central symmetry, angular symme- 



try (Liu, 1990), and halfspace symmetry (Zuo and Serfling, 2000), and the 



following postulate has been formulated. 

• FD4center: If the distribution X is symmetric (in a proper sense) 
about some z'^, then the depth is maximal at z'^. 



As our notion of functional data depth is based on linear functions 99 into 
M*^, the maximality of a cZ-variate data depth at a point of either central 
or angular (or halfspace) symmetry carries over immediately to the same 
maximality of the functional data depth; this holds for any choice of $: 

Proposition 2 If X is centrally symmetric about some z'^ E E and there 
exists some z* & E with D{z*\X) = 1, then the ^-depth D is maximal at z'^ . 
The same holds in the cases of angular and halfspace symmetry. 
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4 Restrictions on ^ 



We now proceed with specifying the set $ of functions and the multivariate 
depth in While many features of our functional data depth 

resemble those of a multivariate depth, an important difference must be 
pointed out: In a general Banach space the unit ball B is not compact, and 
properties FR3 and FR5 (or equivalently FD3 and FD5) do not imply that 
the level sets of a functional data depth are compact. 

We start with an example which demonstrates the need for imposing strong 
restrictions on $. 

Example (Tukey functional depth): Let E be an infinite dimensional 
Banach space. With $ being the whole dual space, $ = E', and the 
univariate halfspace depth TD^, for ^, ^i, . . . , ^„ G M, 

TD\C\^,, . . . , a) = - min{#{z : e. < C}, : 6 > C}} , (5) 

n 

we obtain the Tukey functional depth TD. It satisfies all postulates FDl to 
FD5 and even the stronger FD4R and FD2L (as G $ =^ Ao e 
However this depth is not really meaningful. First, note that the depth 
vanishes outside the convex hull of the data, which has affine dimension 
< n — 1. Moreover, consider a measure /i on {E, B) that has zero probability 
on all finite dimensional linear manifolds (e.g. E = 1^ and an infinite 
product of L-continuous measures on the reals); then TD{- ■ ■ . . . , ^„) = 
/i-almost surely in E. That is, the data version of the Tukey functional depth 



collapses to zero with probability one. Dutta et al. (2011 ) provide an example 



where this happens as well with the population version. 

The reason why the definition of the example collapses is that the dual space 
E' is too large, or, put another way, the unit ball Be of E is not compact. So, 
to obtain a meaningful notion of functional data depth of type ([T| one has 
to carefully choose a set of functions $ that is not too large. On the other 
hand, $ should not be too small, in order to extract sufficient information 
from the data. 

Each 9? e $ corresponds to an aspect of the data that is transformed into 
(i-space. An interesting question is whether the transformation of a depth- 
trimmed region always coincides with the respective region of the transformed 
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data. For this, we introduce the following restriction, which we call the 
surjection property. 

Definition 2 (Surjection property) A functional data depth ([7p has the 
surjection property if, given a cloud X , for every (^j G $ and y there 
exists some z G '^^'^{y) so that 

D{z\X) = D\y\v{X)) . 



Loosely speaking, the surjection property says that the restriction of every 
transformation to points of equal functional and d-variate depth is surjec- 
tive. In the special case of D being a fc-variate data depth {E = M'^) and $ 
a set of real- valued functionals {d = 1), the surjection property becomes the 
strong projection property of depth D; see 



Dyckerhoff (2004). 



Theorem 3 A functional data depth ([ijj satisfies the surjection property 

if and only if for any G $ 

V{D^{X)) = Dii^iX)) (6) 



For proof see the Appendix. 

Examples of functional depths that satisfy the surjection property will be 



given below; see Section 5.1 and 5.3 



5 Special classes of <l>-depths 

This section presents examples of $-depths, where the set $ of aspects is 
chosen in a special way. The Banach space be a space of functions that 
live on a bounded interval J. In the first group of examples, called general 
band depths, the aspects correspond to the points of some subset of J, which 
may be finite or not, and the depth of a function is evaluated at all these 
points. In another group of examples, mentioned as grid depths, a function 
is evaluated on a fc-point grid and the aspects correspond to directions in 
M'^. In both approaches derivatives of the function may be included, which 
results in depths that measure similarities regarding the level as well as the 
slope and possibly higher derivatives of the function. 
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5.1 General graph depth 



Consider E = C {J;R'^) with norm 11 ■ 1 1 and let 

$ = {<^* : E ^ = Mt) ■ ■ ■,Xci{t)),te T} (7) 

for some T G J, which e.g. may be a subinterval or a finite set in J. For 
D"^ use any multivariate depth that satisfies Dl to D5. This results in the 
graph depth 

GD{z\x\ . . . , x") = MD'^{z{t)\x\t), . . . . (8) 

Proposition 3 A graph depth satisfies the postulates FDl to FD5 and, 
in addition, FD2, FD2IR and FD2R. It satisfies FD4con if D'^ satisfies 
FD4con. 



The latter property FD2R means that the development of functions in time 
is irrelevant to a graph depth. In particular, with the univariate halfspace 
d 



depth, 



1 and T = J we obtain the half graph depth (Lopez-Pintado 



and Romo, 2005). Also, with the univariate simplicial depth the band depth 



(Lopez-Pintado and Romo, 2009) is obtained, but this, in general, violates 



monotonicity FD4. The Proposition [3] is proven in the Appendix. 

We illustrate the notion of halfgraph depth by applying it to a wellknown 
data set from Ramsay and Silverman (2005). Figure 1 exhibits the data. 



which describe two angles (hip and knee) of a gait cycle measured over 20 
time periods at 39 children, and the borders of central regions GD^"'^^^^"'^'^ 
(fat lines) for a = .25 and a = .5 , the latter consisting of a single 'deepest' 
function. The bivariate halfspace depth and the central regions have been 



computed with the algorithms given in Rousseeuw and Ruts (1996) und Ruts 



and Rousseeuw (1996). Figure 2 presents central regions of the hip data for 
a e {0.02,0.08,0.14,0.2}. Note that the largest of these regions contains 
all data as a = 0.02 < 1/39. Finally, Figure 3 exhibits those data that are 
not included in the depth trimmed regions, and therefore can be regarded as 
outliers at different levels a. 

Further, we interpret the above data set as two-dimensional: for every time 
point we have two real values, the hip and the knee angle. We choose the 
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Figure 1: Central regions of ot G {0.25,0.5}; hip and knee. 



bivariate halfspace depth as the underlying multivariate depth. Figure 4 
presents central regions of the graph depth for the hip knee data set. 

Under certain conditions the surjection property is true and hence equation 




Proposition 4 The graph depth satisfies the surjection property if the un- 
derlying multivariate depth is continuous in the data. Here is D'^ continuous 
as a function x M"^'" R. 

Proof. We have to show that for all data clouds X, t^, & T and ?/ G M*^ 
exists some z E E with z{t) = y and 

inf D\z{t)\X{t)) = D\z{t*)\X{t*)). (9) 

However, as 2; is a continuous function and the depth D'^ is continuous in the 
data, equation ^ holds for all z. 

For example, the (i-variate zonoid and Mahalanobis depths are continuous in 
the data, while the halfspace and simplicial depths are not. 
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Figure 2: Central regions of GD^ff^raph^ ^ ^ {0.02,0.10,0.15,0.20}; hip 
angle. 



5.2 Location-slope graph depth 

As the $-depth is a multivariate functional depth, we may also apply it to 
a function and its derivatives. The simplest case is considering a univariate 
function x together with its first derivative x' . Then the bivariate functional 
depth measures how similar a given function is to a cloud of functions in 
terms of location and slope. In the framework of general graph depths this 
is easily done as follows. 

Consider E = C^{J;M.) with a proper norm, e.g. ||2;|| = H^Hoo + lk'||oo- Let 
d = 2 and 

$ = ^\x) = ix{t),x\t)y,t G T} (10) 
for some T G J. For use any bivariate depth that satisfies Dl to D5. 
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Figure 3: Outlying data w.r.t. GD'^"-^^^^'^^'^, hip angle. 




Figure 4: Central regions of bivariate GD^^'^^'P"^^, a G {0.02, 0.05, 0.10, 0.12}; 
hip and knee. 



This results in the location- slope depth 

Lsi,(.m^i,>fz,^((!W)|(^w)). („, 
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numerical derivative of hip 



Figure 5: Location and slope, univariate (right panel) and bivariate (left 
panel), of the hip angle. 



Figure 5 exhibits the development of the hip angle and its derivative in time 
separately (right panel) and as a bivariate function (left panel). For these 
data the location-slope graph depth has been calculated with an underlying 
bivariate halfspace depth. Figure 6 shows, for different choices of a, the 
central regions of this location-slope depth. 

Proposition 5 The location- slope graph depth satisfies the postulates FDl 
to FD5 and, m addition, FD2F. Moreover FD2R and FD2IR hold too if 
the rearrangement function p is differentiahle. 



For proof see the Appendix. An important application of a location-slope 
depth is the analysis of registered functional data. Assume that we ob- 
serve time-warped functions, Xi{t) = yi{h~^ (t)) , t G [0,1], where the yi are 
time-synchronized and the warping functions hi are obtained by standard 



procedures from the observed Ramsay and Silverman (^2005|. Then 



we may investigate depth and deepest points of the bivariate functions 

{xi{t), h,{t)),te [0,1]- 



see also Claeskens et al. (2012) 
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numerical derivative of hip time numerical derivative of hip time 



Figure 6: Central regions of location-slope depth (based on bivariate halfs- 
pace depth), a G {0.02,0.05,0.10,0.12}; hip angle. 



Along the same lines we may construct ^-depths that include higher order 
derivatives as well as multivariate functions. In applications, by those depths 
it can be measured how closely a function follows the dynamics of a cloud of 
of functions. 

5.3 Grid depth 

This section introduces the grid functional depth, which is based on a special 
filtering of the data: The functions are evaluated on a fixed grid. While for 
the graph functional depth the functions are considered on the whole interval, 
J, or some subset of it, for the grid functional depth we restrict on values at 
some given points ti, . . . , in J. 

Let E = C(J;]R'^) with norm || ■ ||oo. We choose a finite number of points 
in J, ti,...,tk, and evaluate a function z E E at these points. Notate 
t = {ti, . . . ,tk) and z{t) = {zi{t), . . . , Zd{t))'^ . That is, in place of the function 
z the k X d matrix z^'''^ is considered. A grid depth RD is defined by (1) 
with the following $, 

^ = {ip^: if^iz) = ((r, . . . , (r, z,m, r G S'-'} , (12) 
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which yields 

RD{z\x\...,x'') = inf D\{r,z{t))\{r,x\t)),...,{r,x''{t))). (13) 
From Theorem (1) follows: 

Proposition 6 The class of grid depths satisfies FDl to FD5, with FD3 
restricted to — t- oo. 

Obviously, a grid depth is not invariant to arbitrary or increasing rearrange- 
ments (FD2R or FD2IR), but it is invariant to permutations of {ti, . . . , tk}- 
Also it is not function-scale invariant (FD2F). 

When d = 1 the grid depth can be seen as a multivariate depth in satis- 



fying the weak projection property (Dyckerhoff (2004)), 



D\z\x\...,x'') = inf D\{r,z)\{r,x^),...,{r,x'')). 

r(zsk-i 

In the case d = 1 the grid depth satisfies the surjection property if and only 
if for all X, r* and y there is some z & E with (r*, z) = y and 

RDiz\X) = inf D\{r,zm{r,x\i:)),...{r,x^m 

j.g5fc-i 

= D\y\{r\x\i:)),...{r*,x''m. 
This restriction holds, e.g., for the Mahalanobis and the zonoid depths but 



not for the halfspace depth; for a counterexample, see Dyckerhoff (2004). 



A location-slope grid depth is defined in the same way as the location- 
slope graph depth. Also higher derivatives can be included into the notion 
of grid depth. We omit the details. 



5.4 Choosing the multivariate depth 

In applying the $-depth to a real data problem, we have to make two choices: 
selecting a proper set $ of aspects and choosing an underlying multivariate 
data depth D'^. As we have seen, properties of a $-depth like quasiconcavity 
(FD4con) and the surjection property depend on the choice of V^. However, 
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most important in the selection of D'^ are the questions of computabihty and 
- depending on the data situation - robustness. Mahalanobis depth is solely 
based on estimates of the mean vector and the covariance matrix. In its 
classical form with moment estimates Mahalanobis depth is efficiently calcu- 
lated but highly non-robust, while with estimates like the minimum volume 
ellipsoid it becomes more robust. However, since it is constant on ellipsoids 
around the center, Mahalanobis depth cannot reflect possible asymmetries 
of the data. Zonoid depth can be efficiently calculated, also in larger di- 
mensions, but has the drawback that the deepest point is always the mean, 
which makes the depth non-robust. So, if robustness is an issue, the zonoid 
depth has to be combined with a proper preprocessing of the data to iden- 
tify possible outliers. The halfspace depth is, by construction, very robust 
but expensive when exactly computed in dimensions > 3. As an efficient 
approach the random Tukey depth can be calculated, where the minimum 
of univariate depths in several random directions is determined. This yields 
an upper bound of the halfspace depth; however the number of directions 
has to be somehow chosen. Further qualified candidates, among others, are 
projection depths and - albeit being only mirror symmetric - L^-depths. 



6 Extensions, principal component depth 

More functional depths can be constructed with a generalized versions of 
Definition |l| In Q we may introduce weights w^p > that reflect the relative 
importance of 'direction' This obviously does not affect the validity 

of the above postulates FDl to FD5. 

Definition 3 (Weighted functional data depth) 

Diz\X)=mlw^D\^iz)\^ix'),...,^ixn). (14) 

Also the set $ may be made dependent on the data. This is done in the next 
depth notion, the principal component depth. 

Let E he a separable Hilbert space, e.g. the space C?{J) of all square- L- 
integrable functions or the space £^ of square-summable sequences. Given 
x^, . . . G E flrst a principal component (PC) analysis is performed; see 



20 



Original data curves PC 1 (70,6 % of variability) 




2 4 6 8 10 12 14 16 18 20 2 ■* 6 B 10 12 14 16 18 20 



PC2 {12% of variability) PC 3 (8,4 % of variability) 




time 



Figure 7: Data and first three principal components; hip angle: Mean func- 
tion (solid line), sum of (dash-dotted line) and difference between (dashed 
line) mean function and principal component. 



Ramsay and Silverman (2005). Let yi, . . . ,ym denote the first m principal 
components and 

m 

i=i 

be the least-squares approximation of z G -E. Define 

$ = $(X) = : cfiz) = (r, 7(^)), r G 5™-^} , (15) 

where ■j{z) = (71(2;), . . . , 7m(-2)). Obviously, given the X and hence the 
1/1, ... , ym, the 7i, . . . , 7m are linear and continuous in z. Therefore all (y9 G $ 
are continuous linear functionals. We define the principal component depth 
as follows: 

Definition 4 (Principal component depth) 

PCD{z\X) = inf D' ((r,7(z))|(r,7(xi)), . . . , (r,7(a;"))) 
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Figure 8: Central regions PCD^, a e {0.02,0.10,0.15,0.20}; hip angle. 



Note that many multivariate depth notions, among them the location, zonoid 



and Mahalanobis depths, satisfy the weak projection property (DyckerhofF 



2004). In this case, it holds 

PCD{z\X) = (7(^)|7(x^), . . . , 7(x")) 



Proposition 7 The principal component depth satisfies FD2, FD4con, 
FD5, and slight variants of FDl, FD3 and FD4R, where h resp. z resp. 
r are restricted to linear combinations of the principal components. 



The proof is straightforward and left to the reader. We illustrate the PC- 
depth by applying it to the hip angle data. Figure 7 exhibits the data and its 
first three principal components, which are plotted as perturbations of the 
(pointwise) mean function. We neglect the third component and represent 
each function x by its bivariate component score (7i(x), 72(x)). Then the bi- 
variate halfspace depth is used to construct central regions in the score space 
(Figure 8) and, consequently, in the original data space (Figure 9). Similarly, 
Figure 10 shows outlying data of different maximum PC-depth. Comparing 
the central regions trimmed by PC-depth in Figure 9 with those trimmed by 
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Figure 9: Central regions PCDo,, a G {0.02,0.10,0.15,0.20}; hip angle. 



halfgraph depth in Figure 2, we observe that at level a = 0.02 both trim- 
mings provide the full data set, while at level a = 0.02 the PC-depth yields a 
larger region, which near the right border of the interval spreads significantly 
more out. This illustrates the different approaches: The PC-depth relates to 
the common principal components and measures centrality with respect to 
their loadings, while the halfgraph depth refers to the functional level of the 
data and indicates uniform centrality over the interval. 



7 Outlook 

Our definition [T] of a $-depth for functional data extends immediately to a 
population version, that is to a depth with respect to a probability distribu- 
tion on Banach space E. Note that the above postulates FDl to FD5 can 
be literally translated to the population setting. 

Definition 5 Let X he an E-valued random variable, and $ and as in 
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Figure 10: Outlying data having PC-depth below = 0.0256, ^ = 0.0512, 
t|j = 0.0769, and ^ = 0.1035, respectively; hip angle. 



Definition [1} The function 



D{z\X) = ini D'^{ip{z)\ip{X)) , zeE. 



(16) 



is a functional depth. 



It is easily seen that D{z\X) is well defined for all 2; G -E and that the 
functional depth satisfies the postulates FDl to FD5. More specifically, 
proving FDl, FD2, FD4, FD5 is straightforward. FD3 holds if ||;z|| — )■ 00 
implies that there exists (y9 G $ such that ip{z) — > 00. 

We have demonstrated that the •I'-depths form a comprehensive and fiexible 
class that satisfies the basic postulates of functional data depths and contains 
special notions for diverse applications. However, certain approaches from 
the literature are no $-depths. These are mainly of two types. The first 
type employs random projections of the data: [Cuesta-Albertos and Nieto^ 
Reyes (2008b) define the depth of a function as the univariate depth of the 



function values taken at a randomly chosen argument t. Cuesta-Albertos and 



Nieto- Reyes (2008a) propose the random Tukey depth, which is the minimum 
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univariate location depth of univariate projections in a number of random 
directions; with d-variate data the random Tukey depth converges almost 
surely from above to the Tukey depth. Cuevas et al. (2007) also employ a 
random projection method. The other type uses average univariate depths. 
Compared to this our definition may be mentioned as a 'uniform' depth: 



Fraiman and Muniz (2001) calculate the univariate depths of the values of 
a function and integrate them over the whole interval; this results in kind 
of 'average' depth. Claeskens et al. (2012) introduce a multivariate d > 



1 functional data depth, where they similarly compute a weighted average 
depth. The weight at a point reflects the variability of the function values 
at this point (more precisely: is proportional the volume of a depth trimmed 
region at the point). These notions satisfy the above basic postulates or 
proper modifications of them; but a detailed analysis of them is beyond the 
scope of this paper. 



Appendix 

Proof of Theorem [H 

(i): The function = {(pi{-), ■ ■ ■ , G $ is linear, since it is linear in 

every component. FDl and FD2 are obvious due to the linearity of (p and 
the affine invariance Dl and D2 of . 

To show FD4, assume that z* G f]^^^ip^^{Df{(p{X))), in particular, that 
this intersection is not empty. Then D'^{(f{z*)\(f{X)) = 1 for all ip, and 
therefore D{z*\X) = mf^(z^ D'^{ip{z*)\ip{X)) = 1. We conclude that z* is a 
D-deepest point in X. It holds 

D{z* + ar\X) = ini D'^{ip{z* + ar)\ip{X)) 

= inf D'^(^(^*) + a(^(r)|^(X)). 

Since D'^{ip{z*) + aip{r))\ip{x^), . . . ,{p{x"')) decreases with a > according 
to D4, FD4 is true. 

For FD5, note that 

= {z : D{z\X) > a} = {z : inf D'^{ip{z)\ip{X)) > a} 
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which, as an intersection of closed sets, is closed. 

(ii) : Obvious. 

(iii) : To show FD4con, assume y, z & Da{X), hence, for all 

D\^{y)\^{X)) > a and D''{^{z)\^{X)) > a . 

For every A g]0.1[ follows (due to the linearity of and D4C): 

D'^iifiXy + (1 - X)z)\if{X)) = D\\^{y) + (1 - A)^(z)|^(X)) > a . 

By taking the infimum, conclude that D{Xy + (1 — X)z\X) > a; therefore 
FD4con . 

Proof of Theorem [2| First assume that z E Da{X), i.e., 
D{z\X) = inf D'^{ip{z)\ip{X)) > a . 

Then, for every G obtain D''-{(p{z)\(p{X)) > a, hence (p{z) G D^{(p{X)), 
and therefore z G ip~^{D'^{ip{X))). Conclude 

D^{X)C f] V~\Dt{v{X))) . 

On the other hand, let z G f]^^^(p~^{D^{Lp{X))), which means that for all 
(/? G $ exists some y G D'^{(p{X)) with y = (p{z) and D'''{y\(p{X)) > a. 
Consequently, D{z\X) > a, and z G Da{X), which proves the reverse set 
inclusion, hence Equation Q. (> 

Proof of Theorem [3[ 'only if: Let G $. From Equation Q follows 
that DaiX) C ip-^{D^{ip{X))), and therefore 

^(D^iX)) C Dti^iX)) . (17) 

Now let y G D'^{ip{X)). Then, by the surjection property exists some z with 
ip{z) = y and 

Diz\X)=D''{y\^{X))>a. 

Hence z G Da{X), and therefore y = Lp{z) G (p{Da{X)). We conclude 
D^{ip{X)) C (p{DaiX)) and finally, with 0\ the claimed equality ^. 



26 



'if: Assume that Q holds and let y eR"^, a = D'^{y\ip{X)). Then, for all 

V9 G $ 

y e Di{^{X)) = ^{D^{X)) 

and there exists z e Da{X) so that y = ip{z). By this and the defining 
Equation ([T]) it follows that 

a < D{z\X) < D'^{ip{z)\ip{X)) = a , 

and therefore D{z\X) < D'^{ip{z)\ip{X)) = a, which proves the surjection 
property. 

Proof of Proposition [3[ For FDl to FD5 see Theorem [1} To show 
FD2F, let a & Eq and consider the componentwise multiplication of a with 
some z E E, 

(a ■ z){t) = ait) ■ z{t) = (ai(t)zi(t), . . . , ad{t)zd{t)f = A{t){z{t)) , 

where A{t) = diag{ai{t),a2{t), . . . ,ad{t)) is a regular matrix. Notate simi- 
larly (a ■ X){t) = A{t)X{t). Then, by D2, it holds D'^{A{t)z{t)\A{t)X{t)) = 
D'^{z{t)\X{t)) for every t eT and hence 

GD{a-z\a-X) = mfD'^{{A{t)z{t)\A{t)X{t)) 
= mfD'^{z{t)\X{t)) 
= GD{z\X). 

Next we demonstrate FD2R. Let p be a bijection on J, s = p{t). 

GDizop\x'op,...,x^op) = infD^(z(p(t))|xi(p(t)),...,x"(p(t))) 

= MD'{zis)\x\s),...,x-{s)) 
= GD{z\x\...,x''). 

Obviously, FD2IR follows. 

Proof of Proposition [5| For FDl to FD5 see Theorem [Tj Regarding 
FD2F consider 

a{t) 
a'{t) a{t) 



A{t) 



27 



for some a E E. Then 



\ x'{t) J~\ a'{t)x{t) + a{t)x'{t) J 



holds for every x & E. This imphes 
LSD{a ■ z\a ■ x^ , . . . ,a ■ x") 



= infD 



a-z){t) \ .( {a-x^){t) \ ( (a-x")(t) 



tiT \\{a- z)'{t) )\{a- x^yit) V (« • x^yit) 



= infD^ 

teT 



z{t) \ I ( x\t) \ ( x-{t) 

z'it) j 1 1 x^\t) r'\ x'"{t) 



= L5D(^|x\...,a;"), 

since the property D2 holds for the bivariate depth D"^. To show FD2R 
(and a fortiori FD2IR) define P{t) = \ \ u.\ ] ■ By assumption it holds 



p'{t) 

p'{t) 7^ 0. Then, due to D2 and p(-) being bijective, s = p{t), 
LSD{z o p\x^ o /),..., o p) 



'3^' \\ {z{p{t))y ) I V {x\pm' V {^"-{pm' 
tin' (pit) { 1 |P(t) { « y ( x^m) 

'M^' II z\p{t)) ) I V x^\p{t)) V x-\p{t)) 



z\p{t)) )'^'''\x^\p{t)) )---^^'\x-'{p{t)) 
<P{t)) \ , ( x\p{t)) \ ( x-{p{t)) 



inf L> 



z{s) \ I / x^{s) \ f x"'{s) 



sir \\z'{s) j^\x[{s) v<(«) 

= LSD{z\x\...,x''). 
This completes the proof. 
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